Diffusive behavior and the modeling of characteristic times in limit order executions 
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We present an empirical study of the first passage time (FPT) of order book prices needed to 
observe a prescribed price change A, the time to fill (TTF) for executed limit orders and the time to 
cancel (TTC) for canceled ones in a double auction market. We find that the distribution of all three 
quantities decays asymptotically as a power law, but that of FPT has significantly fatter tails than 
that of TTF. Thus a simple first passage time model cannot account for the observed TTF of limit 
orders. We propose that the origin of this difference is the presence of cancellations. We outline a 
simple model, which assumes that prices are characterized by the empirically observed distribution 
of the first passage time and orders are canceled randomly with lifetimes that are asymptotically 
power law distributed with an exponent Alt. In spite of the simplifying assumptions of the model, 
the inclusion of cancellations is enough to account for the above observations and enables one to 
estimate characteristics of the cancellation strategies from empirical data. 
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I. INTRODUCTION 

Understanding the market micro-structure is crucial for 
both theoretical and practical purposes On double 
auction markets the limit order book contains most of the 
information about the market microstructure and price 
discovery. Recently there has been considerable effort to 
investigate limit order book dynamics. Empirical studies 
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been devoted to the search for the key determinants of 
price formation, the trading process and market organiza- 
tion. A large number of pa per s have focused on modeling 
the limit order book with [U, El M HH, H3 or without 
IH, H3, HH, HH dynamics. Market microstructure studies 
consider a large number of aspects of the price discov- 
ery mechanism and these studies can greatly contribute 
to the success of the modeling of financial markets. The 
market mechanism, along with the complex interactions 
among market participants results in the emergence of 
a collective action of continuous price formation. Some 
of the studies have used an agent based modeling ap- 
proach. Examples are market models described in terms 
of agents interacting through an order book based on 
simple rules [U HH and models where the assumptions 
about the trading strategies are kept as minimal as pos- 
sible [3, [III- One of the most striking findings was 
that even if trends and investor strategies are neglected, 
purely random trading may be adequate to describe cer- 
tain basic properties of the order book [l3[ . 

Most of the above papers focus on limit order execu- 
tions, and very few deal with cancellations, even though 



the frequency of the two outcomes is comparable [i| . The 
uncertainty of execution represents a primary source of 
risk [2{|. Another major risk factor is adverse selection, 
also known as "pick-off" risk. This risk is associated with 
the waiting time until order execution. During this pe- 
riod those with excess information can take advantage of 
the liquidity provided by the limit orders of less informed 
traders, and hence it is important to accurately quantify 
these waiting times. Lo et al. [1] apply survival analysis 
to limit order data, and they find that the time between 
order placement and execution is very sensitive to the 
limit price, but not to the volume of the order. They 
also investigate the dependence on further explanatory 
variables such as the bid-ask spread and the volatility. 
The dynamics of the limit order book has also been in- 
vestigated by using a joint model of executions and can- 
celations in a framework of competing risks 1 . Within 
this approach Hollificld et al. [l||, by using observations 
on order submissions and execution and cancellation his- 
tories, estimate both the distribution of traders' unob- 
served valuations for the stock and latent trader arrival 
rates. Chakrabarty et al. [2!| show that executions arc 
more sensitive to price variation and less to volume vari- 
ation than cancellations. This last work also analyzes the 
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The notion of competing risks applies to problems where one 
deals with several "risks", i.e., random events, of which only the 
first one can be observed |3C| . For example, limit orders are either 
executed or canceled and both events can be modeled by some 
random process. If an order gets canceled, one can no longer di- 
rectly observe what time it would have been eventually executed, 
and vice versa. Thus it is not possible to independently estimate 
cither process without a bias, if one simply ignores information 
from the other one. 
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relationship between execution time and market depth. 

In this paper we aim to go a step further, and combine 
the framework of competing risks with random walk the- 
ory. In particular, we analyze the difference observed be- 
tween the time to fill a limit order, which is the time one 
had to wait before a limit order was executed, and the 
first passage time (3lj . i.e., the time elapsed between an 
initial instant and the time when the transaction price 
crosses a given predefined threshold. In addition, the 
largest difference between our approach and most previ- 
ous studies (e.g., Refs. @, H^]) is that while those placed 
more emphasis on the typical values of execution and 
cancellation times, we will concentrate on the accurate 
description of the rare events, and the related asymptotic 
tail behavior of the distributions. 

We observe that for a fixed price change the first pas- 
sage time distributions of transaction price, best bid and 
best ask are quite well described asymptotically by the 
theoretical form expected for a Markov process with sym- 
metri c jump length distribution (including Brownian mo- 
tion) [3l|,[32|. The empirical time to fill of executed orders 
is smaller than the first passage time. We attribute this 
difference to canceled and expired orders. We propose 
a simple competing risks model, where limit orders are 
removed from the order book when either of two events 
happens: (i) when they are executed, this is modeled 
as the first time when the transaction price reaches the 
limit price, (ii) or when they are canceled, the time hori- 
zon of cancellations is modeled as a random process that 
is independent from price changes. In this framework we 
are able to predict constraints about the tail behavior 
of the time to fill and time to cancel probability densi- 
ties. Our model also allows us to estimate the distribu- 
tion of the time horizons of the placed limit orders. We 
show that the assumption of independence between the 
price changes and order cancellations, while it is a large 
simplification compared to real data, does not affect our 
conclusions significantly. 

The paper is organized as follows. In Section [TT] wc 
describe the investigated market and the variables of in- 
terest. In Section Mil we study the first passage time 
and in Section IIVI the time to fill and the time to can- 
cel. Section [V] describes a simple limit order model and 
Section IVII is devoted to testing the model empirically. 
Section lYlII extends the result to limit orders placed in- 
side the spread. Section IVIIII discusses the validity of 
the assumptions and summarizes the results. Finally, in 
the Appendices wc show that the results are unchanged 
if time is measured in transactions. Then we present a 
critical discussion of the fitting procedure we used to es- 
timate the tail bahavior of the time to fill and time to 
cancel distributions. 



II. THE DATASET 

The empirical analysis presented in this study is based 
on the trading data of the electronic market (SETS) of 



London Stock Exchange (LSE) during the year 2002. 
These data can be purchased directly from the Lon- 
don Stock Exchange. We investigate 5 highly liquid 
stocks, AstraZeneca (AZN), GlaxoSmithKline (GSK), 
Lloyds TSB Group (LLOY), Shell (SHEL), and Vodafone 
(VOD). Opening times of LSE are divided into three pe- 
riods. The intervals 7:50-8:00 and 16:30-16:35 arc called 
the opening and the closing auction, respectively. These 
follow different rules and thus also observe different sta- 
tistical properties than the rest of the trading. Therefore 
we discarded limit orders placed during these times, and 
focused only on the periods of continuous double auction 
during 8:00-16:30. We also removed limit orders that 
were placed during 8:00-16:30 but were canceled (or ex- 
pired) during the opening/closing auctions. We measure 
time intervals in trading time, i.e., we discard the time 
between the closing and the opening of the next day. 2 
Finally, whenever we refer to prices we exclude all trans- 
actions that were executed on the SEAQ market 3 and 
not in the limit order book. 

We denote the best bid price 4 by b(t), the best ask 
price by a(t) and the bid-ask spread is s(t) = a{t) — b(t). 
Except for very special cases, there are already other limit 
orders waiting inside the book when one wants to place 
a new one. Let b(t) — A denote the price of a new buy 
limit order, and a(t) + A the price of a new sell limit 
order. Orders placed exactly at the existing best price 
correspond to A = 0, orders placed inside the spread 
have A < 0, while A > means orders placed "inside the 
book". It is possible to have so called crossing orders with 
such large negative values of A that they cross the spread, 
i.e., A < b(t) — a(t). These orders can be partially or fully 
executed immediately by limit orders from the other side 
of the book. Since a trader would place a crossing limit 
order to execute (at least part of) it immediately, we will 
not consider them as limit orders in our analysis. 

Any limit order which was not executed can be can- 
celed at any time by the trader who placed it. The order 
can also have a predetermined validity after which it is 
automatically removed from the book, this is called ex- 
piry. Wc will not distinguish between these mechanisms 
and we will call both of them cancellation. Throughout 
the paper we will use ticks as units of price and all loga- 



2 In our analyses, we removed the data of trading on September 20, 
2002. This is because on that day very unusual trading patterns 
were observed, including an anomalous behavior of the bid-ask 
spread. 

3 Many studies refer to this colloquially as the "upstairs" market. 

4 In most of the literature the logarithm of the price is modeled, 
while throughout the paper wc intentionally use price itself. Our 
study is concerned with very small price changes on the order 
of the spread, when there is little difference between the two 
approaches. In our case it is important to keep bare prices, as 
stocks have a finite tick size (minimal price change) . Taking bare 
prices enables us to classify the orders into discrete categories by 
price difference. The size of ticks depends on the stock, the 
possible values are 1/4, 1/2 or 1 penny. 
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rithms arc 10-base. 



III. THE FIRST PASSAGE TIME 

Let the latest transaction price of an asset at time 
to = be Sq. The first passage time [3l[ of price through 
a prescribed level So + A with some fixed A > is defined 
as the time t of the first transaction when S(t) > So + A. 
Similarly we can determine the first time after to = 
when the transaction price was below or equal to Sq — A 
and we will consider this time as another, independent 
observation of t. We will call the distribution of the quan- 
tity t the first passage time distribution to a distance A, 
and denote it by Pfpt ; a(£)- 

Such first passage processes have been studied exten- 
sively [33|. For simplicity we will restrict ourselves to 
driftless processes. This is justified, because in real data 
for time horizons t of up to a day the drift of the prices is 
negligible. This means that the ratio \/j,\y/t/a is small (it 
is always less than 10 _1 in our dataset), where \i is the 
mean price change over unit time, and <r is the standard 
deviation of price changes during a unit time (i.e., the 
volatility). Throughout the paper we use real time 5 . 

For the following analysis of empirical data, it is useful 
to review the first passage time distribution for Brownian 
motion without drift. This is can be written as 31 1 
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which is the fully asymmetric 1/2-stablc distribution. For 
any fixed A the asymptotics for long times is 



-Pfpt ; a(£) oc t 



-3/2 



(2) 



A recent study [32| has clarified that this asymptotic 
behavior is valid not only for Brownian motion but also 
for any Markov process with symmetric jump length dis- 
tribution. 6 Of course, real price changes are not de- 
scribed by continuous values, and transactions and order 
submissions are also separated by finite waiting times, 



5 We repeated the statistical analysis with transaction time and 
observed a similar power law decay of the first passage time for 
large times. The value of the power law exponent turns out to 
be different for real time analysis and transaction time analysis. 
See Appendix 1X1 for details. 

6 This result is consistent with the Sparre- Andersen theorem [33l | . 
Alternative descriptions obtained for the asymptotic time de- 
pendence of the FPT of Levy flights which were hypothesizing 
a dependence of the distribution exponent from the index of the 
Levy distribution have missed the fact that the method of im- 
ages, which is extremely powerful in Gaussian diffusion, fails for 
Levy flight processes l32l . The behavior is of course more com- 
plex in the case of Levy random processes described by using a 
subordination scheme. In these cases the asymptotic behavior 
of first passage time depends on the complete properties of the 
subordination procedure |34| . 



which a continuous time random walk formalism could 
take into account [H, [36[. However, in this paper we are 
interested in time intervals much longer than these wait- 
ing times, so the discrete aspects of the dynamics are 
negligible. Thus, we will model prices as if they varied 
continuously in time. 

Let us now investigate empirically the first passage 
time behavior. The first passage time distribution for 
the transaction price, bid and ask when A = 1 tick is 
shown in Fig. [1] for the stock GSK. The distribution 
is obtained by sampling the first passage time at each 
second. One can see that there are no significant differ- 
ences in the behavior of the three prices. Qualitatively, 
the distribution is similar to Eq. {T]), and the long time 
asymptotic of real data seems to decay approximately as 
£-3/2 p or times shorter than 1 minute the curves sig- 
nificantly deviate both from the power law behavior and 
from the prediction of Eq. |T]). We choose to fit the first 
passage time distribution with the function 



FPT 



;A(*) = 



Ct 



- Afpt 



1 + [t/T F PT(A)]- AFPT+A FP 



(3) 



This form, that we will use to fit also the other distri- 
butions introduced below, is characterized by two power 
law regimes. Normalization conditions of Eq. ([3]) im- 
ply that Afpt > 1 and A FPT < 1. For t < T F pt(A) it 
is Pfpt ; a(0 oc i~ A FP T; whereas for t > T F pt(A) it is 
Pfpt ; a(£) £~ Afpt . Wc will discuss the motivations for 
choosing this form in Section IV and in the Appendix. 

Tablc|T]contains the fitted parameters Afpt, A fpt , and 
Tfpt(A) for A = 1, . . . , 4 ticks. The difference between 
the actual values of Afpt and 3/2 from Eq. ([2]) is small. 
Systematic deviations due to clustered volatility or the 
fluctuations of trading activity could not be identified. 
For example, the asymptotic shape of the distribution 
does not change, even if time is measured in transactions 
instead of seconds (see Appendix |A"|) . 

The observation that Afpt < 2 implies that the theo- 
retical mean and standard deviation of the first passage 
time distribution are infinite. Thus one should be careful 
with the interpretation of means calculated from finite 
samples. Throughout the paper we will rely on the de- 
termination of quantiles (e.g., the median) instead, which 
arc always well-defined regardless of the shape of the dis- 
tribution. 

The inset of Fig. [T] shows the median first passage time 
as a function of A for the five investigated stocks. The 
behavior is not exactly quadratic (A 2 ) as one would ex- 
pect from Eq. ([1]). If prices followed a Brownian motion, 
the q-th quantile (T q ) of the first passage time distribu- 
tion would be 



T - A 2 
9 ~ 2 ( 7 2 [erf C - 1 (<?)]' 



(4) 



where the median (M [FPT]) corresponds to q = 0.5. In 
reality, the power law behavior with A is less evident, 
as shown by the inset of Fig. [1] Assuming a behavior 
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Pttf(£)i -Pttff (i) j and -PttcMj respectively. 



price, A=1 
bid, A=1 
ask, A=1 
Brownian 
fit of Eq. (6) 




slope -1.5 
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Figure 1: First passage time distributions for the price, bid 
and ask quotes of GlaxoSmithKline (GSK), distance A = 1 
tick. The dotted line is the first passage time distribution 
for Brownian motion with volatility a = 1/7 pennyx sec _1//2 . 
The thick solid line is a fit with Eq. (|3]) as given in Table [T] 
The inset shows the median first passage time as a function 
of A. 



M [FPT] cx A v would require an exponent varying be- 
tween 1.5 and 1.8 depending on the specific stock and 
the precise range of A used for the estimation of n. A 
similar deviation from the prediction of Brownian motion 
was reported in Ref. (3?| in the analysis of closure index 
values sampled at a daily time horizon. 

There are many differences between real prices and 
Brownian motion, and the above non-quadratic behav- 
ior can come from any of them: the non-Gaussian distri- 
bution of returns, the superdiffusivity of price, perhaps 
both or none. We have performed a series of shuffling ex- 
periments and preliminary results support the conclusion 
that the main role is played by the deviation from Gaus- 
sianity. This non-Gaussianity is well documented in the 
literature down to the scale of single transactions [l§] ■ A 
similar effect was seen for Levy flights, whose increments 
are also very broadly distributed, and their value of r\ can 
be different from 2, and it is related to the index of the 
corresponding Levy distribution [38j . 



IV. TIME TO FILL, TIME TO CANCEL 

For an executed order the time elapsed between its 
placement and its complete execution is called time to 
fill. Orders are often not executed in a single transaction, 
thus one can also define time to first fill, which is the 
time from order placement to the first transaction this 
order participates in. Finally, for canceled orders one 
can define the time to cancel which is the time between 
order placement and cancellation. The distribution of 
these three quantities will be in the following denoted by 



A. Properties of the distributions 

As a first characteristic of the order book, we inves- 
tigate the distribution of time to fill and time to cancel 
for the stocks in our dataset. Fig. [2] shows these distri- 
butions for GlaxoSmithKline (GSK) for different values 
of A. Similarly to the first passage time, we fitted the 
empirical density with the function 



TTF 



■A(t) 



C't 



U-\n 



1 + [V7ttf(A)]- Attf 



(5) 



This form ([5]), which we used to fit the FPT in the 
previous Section, is different from the more familiar gen- 
eralized Gamma distribution used in Ref. Q . The reason 
for our choice is that we concentrate on the tail behavior 
of time distributions. According to our measurements the 
FPT, TTF and TTC distributions have fat tails, which 
can be well described by power laws. The generalized 
Gamma function has too slow convergence to a power 
law to describe the observed tails in the time range of 
our investigations. A detailed discussion of this problem 
is provided in Appendix [Bj 

We also emphasize that in the present study we do 
not intend to discuss in detail the behavior on short time 
scales. We assume that this regime is simply charac- 
terized by the exponent A' only to perform a quick and 
efficient fit. This choice will have no direct relevance to 
our main conclusions, which always apply to the tails of 
the distribution. 

Nevertheless, in addition to the very good fit at large 
times the above formula gives for some cases an overall 
good description also at short times. Table HT1 shows the 
results for all five stocks. We find that Attf, which gives 
the asymptotic behavior of the distribution, ranges be- 
tween 1.8 and 2.2 for up to A = 4 ticks. This is greater 
than the value Ref. @] found for NASDAQ. The expo- 
nent A T tf varies between —0.4 and 0.6. Finally Tttf 
typically grows with A, as orders placed deeper into the 
book are executed later. We will return to this obser- 
vation in Section IIV CI For A > 4 the small number 
of limit orders in our sample does not allow us to make 
reliable estimates for the shape of the distribution. Fig. 
[5] also gives a comparison of four further stocks (AZN, 
LLOY, SHEL and VOD) to show that our findings are 
quite general. The distribution of time to first fill is in- 
distinguishable from time to fill. 

For time to cancel one finds a similarly robust behavior, 
also shown in Fig. [2j Its distribution is again well fitted 
by the form 



P 



TTC;A 



(*) 



(Jtt £ — Attc 



i + [t/r T Tc(A)]- A ™+A^ 



(0) 



where the long time asymptotics has an exponent Attc 
ranging between 1.9 and 2.4. Unlike the case of Attf, the 
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stock 


A = 1 


A = 2 


A = 3 


A = 4 


A A' T 


A A' T 


A A' T 


A A' T 


AZN 
GSK 
LLOY 
SHEL 
VOD 


1.50 0.14 58 
1.52 0.16 62 
1.54 0.22 85 
1.52 0.20 83 
1.57 0.43 150 


1.50 0.22 140 

1.52 0.18 230 
1.55 0.20 280 

1.53 0.27 160 

1.54 -0.19 450 


1.50 0.18 240 

1.50 -0.02 390 
1.53 0.01 460 

1.51 0.02 360 
1.49 -0.69 720 


1.49 0.11 350 
1.48 -0.21 520 
1.51 -0.12 630 
1.51 0.00 450 
1.51 -0.66 1500 



Table I: Parameters of the fitting function (|3} for the distribution of first passage time for the five stocks. A is measured in 
ticks and all times are given in seconds. Typical standard errors for the quantities: ±0.05 for Afpt, ±0.05 for A FPT , and ±10% 
for Tfpt- 



measured values of of Attc & r e in agreement with those 
measured in Ref. [1] for NASDAQ. All results concerning 
the time to cancel are given in Table IIIIl 

As for the FPT, for both TTF and TTC the asymp- 
totic power law behavior and the value of exponents is 
preserved if time is measured in transactions, see Ap- 
pendix [AJ 

B. Comparison of characteristic times 

The emphasis of this paper is on the interplay between 
order execution, order cancellation and the first passage 
properties of price. To understand this relationship, con- 
sider the following argument proposed in Ref. Q . Imag- 
ine that there are no cancellations. Let a buy order be 
placed at the price bo — A, when the current best bid is 
at bo (the argument goes similarly for sell orders). How 
much time does it take until this order is executed? It 
is certain that the order cannot be executed before the 
best bid decreases to bo — A, because until then there 
will always be more favorable offers in the book. On the 
other hand, once the price decreases to bo — A — e where e 
is the tick size of the stock, it is certain, that all possible 
offers at the price bo — A have been exhausted, including 
ours. Therefore both time to fill and time to first fill for 
any order placed at a distance A from the best offer is 
greater than the first passage time of price to a distance 
A, and less than that to A±e. Since this is true for every 
individual order, one expects the following inequality for 
the distribution functions of characteristic times: 

/ PfPT;A (t')dt' > f P T TF;a(*'K> 

Jo Jo 

f PTTFF;AW> / P F p TA +e(t')dt'. (7) 
JO Jo 

Using the empirical distributions above, a straightfor- 
ward calculation yields 

Afpt = -^ttff = ^ttf> (8) 

which is in clear disagreement with the data, where pro- 
nouncedly Afpt < Attf ~ Attff- This inequality for 
the tail exponents means that one finds less orders with 
very long time to (first) fill than expected. The resolu- 
tion of this apparent contradiction is that cancellations 



have to be taken into account: Orders which would have 
to wait too long before being executed are often canceled 
and thus removed from the statistic. The measurement of 
the cancellation time distribution suffers from the same 
bias. The observed distribution of time to cancel does 
not characterize how traders would actually cancel their 
orders, because here the executed orders are missing from 
the statistics. 

In Section [V] we will present a simple model that gives 
insight into the features pointed out so far. However, 
before doing so, we would like to present one further point 
concerning the empirical data. 

C. The role of entry depth 

How do order execution times change as a function of 
the entry depth A? Similarly to first passage times, the 
empirical distributions found for time to fill/cancel have 
a slowly decaying tail such that the means might diverge. 
Therefore, in the following we will use the medians of all 
quantities as a measure of their typical value. 

In Fig. [3] we show that the median of time to fill is 
empirically well described by 

M [TTF] oc A 1 ' 4 , (9) 

which is quite different from the M [TTF] cx A 2 expected 
naively from Eq. ([4]) and a Brownian motion assumption, 
and also from the A 15 " 18 behavior observed for the first 
passage time. We show that cancellations play an impor- 
tant role in these discrepancies. 

Let us make a surrogate experiment with the data of 
the stock GSK. We select all filled orders, and from the 
time of their placement we calculate the first time when 
the transaction price becomes equal to or better than 
the price of the order. If one plots the median of this 
quantity versus the A of the orders, the resulting curve 
is indistinguishable from the median of time to fill [Fig. 
©left), curve labeled as "TTF/FPT filled ord"]. Thus the 
exponent 1.4 does not come from the difference between 
order executions and first passage times. 

In another surrogate experiment we keep the time of 
order placements, but shuffle the A values between or- 
ders. This way we destroy correlations between volatility 
and order placement. We record the corresponding first 
passage times. The resulting curve is labeled as "FPT 
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Figure 2: Top left: distribution of time to fill of GSK for A = 1 ... 4 ticks, and fits with Eq. Ipa). The dashed line is a power law 
with exponent —2.0. Top right: distribution of time to cancel of GSK for A = 1 ... 4 ticks, and fits with Eq. (|6]). The dashed 
line is a power law with exponent —2.0. Bottom left: comparison of distributions of time to fill for three typical stocks, A = 1 
tick, and fits with Eq. ©. The dashed line is a power law with exponent —2.0. Bottom right: comparison of distributions of 
time to cancel for three typical stocks, A = 1 tick, and fits with Eq. (|6}. The dashed line is a power law with exponent —2.0. 



stock 


A = 1 


A = 2 


A = 3 


A = 4 


A A' T 


A A' T 


A A' T 


A A' T 


AZN 
GSK 
LLOY 
SHEL 
VOD 


2.0 -0.0 65 
1.9 -0.2 68 
2.0 -0.1 85 
1.9 -0.1 77 
1.8 -0.4 190 


1.9 0.0 100 
1.9 -0.2 150 
1.9 -0.1 160 
1.9 -0.2 110 
1.8 -0.5 490 


1.8 -0.0 120 

1.8 -0.4 190 

1.9 -0.2 240 
1.9 0.0 270 
1.8 -0.4 980 


1.9 0.0 200 

1.8 -0.3 320 

1.9 -0.2 350 
1.8 -0.1 250 



Table II: Parameters of the fitting function <(Sj for the distribution of time to fill for the five stocks and A > ticks. All 
times are given in seconds. Data are missing where the statistics was inadequate for fitting. Typical standard errors for the 
quantities: ±0.1 for Attf, ±0.1 for A^tfj an d ±10% for Tttf- 



snuff, all ord". This new curve now agrees with the first 
passage time of price [curve "FPT, price (book only)"] 
when A > 8 ticks, which corresponds to a median time 
of about 1 — 2 hours. The origin of the anomalous A- 
dependence is, at least in the large A case, therefore the 
presence of cancellations. The explanation of other con- 



tributions requires more involved arguments which are 
beyond the scope of this paper. 

The dependence of median time to cancel on the en- 
try depth A has a less clear functional form, as shown 
by Fig. 2J While M [TTC] appears to be a monotoni- 
cally increasing function of A, the curves for the different 
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stock 


A = 1 


A = 2 


A = 3 


A = 4 


A A' T 


X X T 


X X T 


X X T 


AZN 
GSK 
LLOY 
SHEL 
VOD 


2.2 0.6 87 

2.2 0.5 110 

2.3 0.5 130 

2.4 1.1 150 
2.0 0.9 300 


2.2 0.6 90 

2.0 0.5 90 

2.2 0.4 140 

2.3 1.1 140 

2.1 0.8 1000 


2.2 0.6 85 
1.9 0.5 94 
2.0 0.4 120 

2.3 1.1 68 
2.2 0.6 1500 


2.2 0.7 100 
1.9 0.6 170 
2.0 0.5 250 
2.2 1.0 56 
1.9 0.5 1000 



Table III: Parameters of the fitting function (|6]) for the distribution of time to cancel for the five stocks and A > ticks. All 
times are given in seconds. Data are missing when there were no orders at all, or the statistics was inadequate for fitting. 
Typical standard errors for the quantities: ±0.1 for Attc, ±0.1 for Atto an d ±25% for Tttc- 



stocks show only a qualitative similarity. One of the rea- 
sons may be that different cancellation mechanisms are 
treated together. 



V. A SIMPLE MODEL OF THE 
CHARACTERISTIC TIMES 

The problem of the interplay between time to fill and 
time to cancel is an example for competing risks (IH [3^ | . 
In this framework mutually exclusive events are consid- 
ered in time (lfl . |30| : in our case after its placement a 
limit order is cither executed or canceled. Each of these 
events has its own probability distribution for the time 
when it will occur, but only the earliest one of the events 
is observed. In this section we present a simple joint 
model 7 of limit order placement and cancellation that is 
of this type. We will see that the model gives predictions 
that can be tested against real data. Moreover, it also 
gives indications on the statistical properties of a quan- 
tity that is directly unobservable: the "lifetime" an agent 
is willing to wait for a limit order to be executed. 

We make the following assumptions: 

1. We consider one "representative agent" [4l|. At 
time t = the agent places a single buy 8 limit 
order at a A > distance from the current best of- 
fer. (A generalization to A < is given in Section 
I VIII ) We treat all the other market participants on 
an aggregate level. 

2. The agent is not willing to wait indefinitely for the 
order to be executed. Instead, at the time of place- 
ment the agent also decides about a cancellation 
(or more appropriately expiration) time t' for the 
order. This is a value drawn randomly from the dis- 
tribution Plt;a(^')- We will call this function the 
lifetime distribution. If the order is not executed 
until t', then the order is canceled. The agent has 



Ref. [40ll shows that similar arguments give a very good approx- 
imation for the average shape of the order book. 
Note that throughout the paper we use the language of buy or- 
ders, but analogous definitions can be given for sell orders. All 
measurements include both buy and sell orders. 



no additional cancellation strategy. This assump- 
tion is very restrictive (cf. Ref. [3), but as Section 
IVHI Al will show, it does not affect our results sig- 
nificantly. 

3. The market is very liquid and tick sizes arc small. 
As a consequence, 

(a) before its execution, the effect of the agent's 
limit order on the evolution of the market 
price is negligible. This point neglects that 
traders reveal private information about their 
valuation of the stock by placing limit orders. 

(b) the interval between the time when the best 
bid reaches the order price and when the 
agent's order is executed is negligible. Wc 
also assume that such immediate execution is 
independent of the volume of the agent's or- 
der. A simple way to motivate that the vol- 
ume present at a given price does not strongly 
affect execution times is to measure the typi- 
cal ratio between time to fill and time to first 
fill as a function of the volume of the order. 
For at least 75% of the orders of any volume 
this is close to 1. The only exceptions can 
be very large orders with A = 1. Here the 
price reaches the order quickly, but it takes 
about 20% longer to execute it completely (see 
also Ref. 0]). Moreover, for real limit orders 
the median time to fill does not depend too 
strongly on the volume of the order, except 
for very large volumes, see Fig. [5j 

In our study we included SHEL and VOD which are 
known to have large tick/price ratios, so Assump- 
tion 3 would be invalid. Contrary to our expecta- 
tions, wc did not find any indication of anomalies 
like in other studies [HI, 53, EH, 5H , and the model 
proved useful for these stocks as well. 

Under our assumptions one can write a joint density 
function that describes both the price diffusion process 
and cancellations. The probability P&(t, f) that the price 
reaches an order placed at a distance A > from the 
current best offer at time t (and then it can be executed 
immediately), and that the agent decides to cancel the 
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Figure 3: Left: The dependence of characteristic times on the entry depth A for GSK. Empty squares show time to fill, the 
curve for the surrogate first passage time assigned to filled orders is indistinguishable from this one. The dotted lines are 
M [■] oc A ' and M [■] oc A 1 ' 9 . Right: The dependence of median time to fill on the entry depth A for four stocks. The curves 
show a strong similarity and they are significantly better fitted by M [TTF] oc A 1 ' 4 than by the naive ansatz M [TTF] oc A 2 
from Brownian motion. 



o 



10 4 



10 



<5 1CT 



10 1 



— □ — AZN 




— ° — GSK 




— a — LLOY 




SHEL 




— VOD 









10 u 



10 1 
A (ticks) 



10' 



Figure 4: The dependence of median time to cancel on the 
entry depth A of the limit order. The curves have an increas- 
ing tendency and they are qualitatively similar across stocks. 
However, they do not follow any obvious functional form. 



order a time t' can be written as a product of two inde- 
pendent distributions: 



Pa(M') = PFPT;A(t)PLT;A{t') 



(10) 



For each limit order values of t and t' are drawn from P. 
The limit order is executed if t < t' or it is canceled if 
t > t' . The two cases arc illustrated in detail in Fig. [51 



VI. THE PREDICTIONS OF THE MODEL 

Competing risk models are often estimated by the pro- 
cedure introduced by Kaplan and Meier This is 
a statistically consistent, non-parametric method to es- 
timate the marginal distributions Pfpt and Plt from 
Pttf and Pttc under the assumption that execution 
and cancellation are independent as we already assumed 
in writing Eq. (flT)]) . We will now calculate these esti- 
mates in another, but strictly equivalent analytical way. 

Let us denote distribution functions as follows: 



PX:A(>t) 



Px- A (r)dT, 



(11) 



where X can be any process introduced above (FPT, LT, 
TTF, TTFF, TTC). We will omit the lower index A for 
brevity. Let us first express the previously introduced 
quantities in terms of the joint probability Pa(£, i') and 
via Eq. (flT)]) . For executed orders t < t', thus the distri- 
bution of time to fill is given by 



Pttf (£) 



PFPTft)P LT (> t) 

Jo° Pfpt(t)P LT (> r)dr 

Af[P F PT(t)PLT(> t)}. 



(12) 



We introduced the operator Af[-], which normalizes a 
function to an integral of 1. Symmetrically for time to 
cancel t < t'\ 



PTTGit)= r?M>t)PMt) = 

[ ) JT^ptO r)P hT (r)dr 

Af[P FPT (> t)P LT (t)}. 



(13) 



As (fT2"]) and (fT3^) are two equations with only one un- 
known function, namely the lifetime distribution Plt (t) ; 
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volume volume 



Figure 5: Left: median time to fill for AZN and GSK, A = 1 and 2 ticks, as a function of order volume. The value does not 
depend strongly on order size except for very large orders. Right: median time to cancel for AZN and GSK, A = 1 and 2 ticks, 
as a function of order volume. The value does not depend strongly on order size. 




Figure 6: Left: The scheme of the diffusive model for execution times (example with buy orders). Orders are indicated by 
thick horizontal lines. The order is placed at a distance A below the current best bid. At the time of its placement the order is 
assigned a lifetime (the length of the thick line). If the bid crosses the line, then the order is executed at the time of crossing. 
The time between order placement and the crossing is the first passage time of the bid price to a distance A. If there is no 
crossing, the order is canceled at its cancellation time (the end of the thick line). Right: Orders are executed when the first 
passage time is less, and canceled when larger than the intended lifetime. 



one can calculate that from, e.g., Eq. (fT2| . and then see if 
the solution is consistent with Eq. (fT3|) . We can express 
from Eq. (O, that 



Plt(> t) (x 



-Pttf (t) 
PfptW 



and thus 



d 



PLT(t) = -—PLT(> t) = ~N 

at 



d PttfW 



(14) 



(15) 



dtP F v T (t)_ 

It is also possible to estimate the same quantity directly 



fromEq. (fT3f : 



Put (t) = JV 



P 



TTC 



(*) 



(16) 



P FPT (> t)_ 

Let us eliminate the lifetime distribution, and substitute 
the large t asymptotic power law behavior of all proba- 
bilities. After simple calculations one finds that 

Attf = Attc- (17) 

Then we substitute this result back into Eq. (fT4|) to find 
that the lifetime distribution also has to decay asymptot- 
ically as a power law: 



(18) 



10 



with 



A 



LT 



A 



TTF 



A 



FPT 



1 = A 



TTC 



A 



FPT 



1. 



(19) 



Eq. (fTT|) is in good agreement with the results of Sec- 
tion |IV] where Attf = 1-8 - 2.2, and A T tc = 1-9 - 2.4. 
This is a clear improvement compared to Eq. ([8]). The 
introduction of the simplest possible cancellation model 
gives a good prediction for the difference between the ex- 
ponents describing the asymptotics of the first passage 
time and time to fill. 

Moreover, one can now observe the hidden distribution 
of lifetimes. By substituting the typical values into Eq. 
(|19[) . one gets Alt ~ 1-6. In comparison, a paper by Bor- 
land and Bouchaud [45[ describes a GARCH-like model 
obtained by introducing a distribution of traders' invest- 
ment horizons and the model reproduces empirical val- 
ues of volatility correlations for Alt = 1-15, which is not 
far from our estimate. More recently it has been shown 
[46| that the limit order price probability distribution is 
consistent with the solution of an utility maximization 
problem in which the limit order lifetime is power law 
distributed with an exponent Alt — 1.75. The origin of 
the power law distribution of limit order lifetimes is not 
clear. Unfortunately the data do not allow us to separate 
individual traders. Therefore we do not know whether 
such a result arises from the broad distribution of the 
time horizons of each trader, or simply a distribution of 
traders with different investment strategies. Based on an 
empirical investigation at the broker level, in Ref. [46[ it 
is argued that heterogeneity of investors could be the de- 
terminant of the power law lifetime distribution. Notice, 
however, two points: (i) We are not speaking about how 
long the investors hold the stock. Instead, Plt is the 
distribution of how long investors are willing to wait for 
their limit orders to be executed and before they cancel 
or revise their offers, (ii) None of the limit orders we are 
discussing here are truly long-term. Even the orders with 
relatively long lifetime spend at most a few days in the 
book. 



VII. AN EXTENSION TO A < 

So far we only considered orders with prices which were 
worse than the best offer at the time of their placement, 
i.e., A > 0. However, this group only accounts for less 
than half of the actual limit orders. Measurements for 
A < orders give the surprising result that these execu- 
tion times are described by statistics very similar to those 
for A > 0. One example stock (GSK) is shown in Fig. 
[TJleft). The results of our fitting procedure performed 
with Eq. ([5]) are given in Table ITVl for all five stocks. 

According to our model, these orders should have been 
executed within a negligible time of their placement. 
While this is true for a number of them, certainly not for 
all. Let us assume that we are placing a new buy limit 
order. If our order has A = 0, then it will be among the 
best offers at the time of its placement. If our order has 



A < 0, then it becomes the single best offer in the book, 
and hence it will trade with certainty if the next event is 
a buy market order. Why can our order still take a long 
time before being executed? The answer is naturally that 
before our order is executed, a new buy limit order may 
enter the book. If this new order has A < (where A 
now has to be measured from our order), it means that 
it has an even better price than our order and it will 
gain priority of execution. On the other hand, our order 
now effectively has A > 0, and the original model can be 
applied. 

In order to test such a hypothesis, we carried out the 
following calculation. For the sake of simplicity, we will 
consider the time to first fill instead of time to fill. Sec- 
tion [V] argued that for the majority of orders the differ- 
ence between the two is negligible. From the time of its 
placement, we tracked every single at least partially filled 
A < order until the time it was first filled. We defined 
the reduced entry depth (A') and the reduced time to 
first fill (TTFF') for these orders as follows 

1. For orders, where from their placement to their first 
fill there were no even more favorable orders both 
placed and then at least partially filled, A' = and 
TTFF' = TTFF. 

2. For orders where after their placement but before 
their first fill there was at least one new, more fa- 
vorable order introduced with A nGW < and then 
this new order was at least partially filled, we se- 
lected the first of such new orders placed after the 
original one and set A' = — A ncw . Thus, A' is the 
new position of the original order, after the new 
one was placed. TTFF' is defined as the time to 
first fill of our order measured from the placement 
of this new order. 

The typical distribution of TTFF' for different groups 
in A' is shown in Fig. [7K right). For orders with A' = 
this is - except for here uninteresting very short times 
- well described by a stretched exponential distribution 

r i /2~ 

Pttff'(^) — lk cx P — (|) " ■ These arc the orders, 
where there was no better offer made, and hence their 
execution times were purely determined by the incoming 
market orders. The distribution is very close to the distri- 
bution of the times between two consecutive transactions 
of the stock [see Fig. EJright)]. 

For orders with A' > 0, one recovers the results of the 
previous sections, and the distribution of reduced time 
to first fill asymptotically decays as a power law with a 
power close to 2.0. Eqs. (fT7| and (flU))) are expected to 
be valid for orders with A < and A' > as well, given 
that we use them in terms of A' and TTFF'. 

As a summary, time to first fill for orders with A < 
is a two-component process. If there is no better order 
placed before the first fill, then time to first fill is basi- 
cally identical to the waiting time distribution between 
opposite market orders. If there is a better offer submit- 
ted, then the order effectively becomes A > 0, and the 
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stock 


A = 


A= -1 


A = -2 


A A' T 


A A' T 


A A' T 


AZN 
GSK 
LLOY 
SHEL 
VOD 


2.2 0.6 110 
2.2 0.5 110 
2.2 0.5 120 
2.2 0.4 110 
2.1 0.5 160 


2.2 1.0 230 
2.1 1.1 180 
2.1 1.0 150 
2.1 1.0 120 
2.0 1.1 130 


2.1 1.2 320 

2.2 1.3 410 

2.0 1.2 220 

2.1 1.0 140 



Table IV: Parameters of the fitting function ([5} for the distribution of time to fill for the five stocks and A < ticks. All times 
are given in seconds. Data are missing when there were the statistics was inadequate for fitting. Typical standard errors for 
the quantities: ±0.1 for Attf, ±0.1 for Attfi an d ±25% for Tttf- 



stock 


A = 


A = -l 


A = -2 


A A' T 


A A' T 


A A' T 


AZN 
GSK 
LLOY 
SHEL 
VOD 


2.3 0.6 130 

2.1 0.6 130 

2.2 0.5 120 

2.3 1.0 220 
2.0 0.7 200 


2.0 0.7 90 
1.9 0.7 90 
1.9 0.7 70 
2.2 1.0 130 
1.8 0.8 120 


1.9 0.8 72 
1.8 0.8 50 
1.8 0.9 85 
2.0 1.0 160 



Table V: Parameters of the fitting function (|6]) for the distribution of time to cancel for the five stocks and A < ticks. All 
times are given in seconds. Data are missing when there were no orders at all, or the statistics was inadequate for fitting. 
Typical standard errors for the quantities: ±0.1 for Attc, ±0.1 for AxtCi an d ±25% for Tttc- 



diffusion approximation applies. As this latter process 
has a much fatter tail than the former one, long waiting 
times and the tail exponent of the joint process are again 
dominated by a first passage process. 



VIII. DISCUSSION 
A. Lifetime distribution 

Before discussing the results let us analyze the most 
important simplifying assumption of our model, namely 
the independence of the lifetime of the order from the 
evolution of price. This would mean that traders decide 
about an expiry time of their limit orders at the time 
of their placement, and then do not cancel them earlier, 
which resembles the random cancellation process as in- 
troduced in Ref . [l3[ . In order to see the relevance of our 
assumption one should calculate the cross-correlation co- 
efficient of first passage times and the lifetime process. 
However, as mentioned in Section [V] we are limited by 
the fact that the lifetime is hidden. It is not possible to 
calculate cross-correlations between time to fill and time 
to cancel either, because for the same order one cannot 
observe both variables. This issue is related to the iden- 
tifiability problem of competing risks [30| . 

We suggest the following approach to resolve the above 
issue: Let us consider canceled orders only. There one 
can observe the values of the lifetime, because they were 
realized as an actual time to cancel. Moreover, our model 
assumed, that the order would have been executed at the 
first passage time (the time of the first transaction at the 
order's or a better price). Now it is possible to quantify 
cross-correlations between these two quantities, but one 
has to keep in mind three points. (Note that we will 



consider orders with A = 1 to have the largest possible 
sample.) 

1. For very short times the price dynamics is domi- 
nated by bid-ask bounce, and other non-diffusive 
processes [47j. Our model is not valid in this 



regime, because rapid order executions are not gov- 
erned by a first passage process. Hence we discard 
all orders which were canceled within L = 4 min- 
utes of their placement. 

2. In order to avoid problems arising from the possible 
non-existence of the moments of the distributions, 
we choose to evaluate Spearman's rank-correlation 
coefficient 9 (p), instead of Pearson's correlation co- 
efficient. The quantity p has further favorable sta- 
tistical properties, for example it is not very sensi- 
tive to extreme events. 

3. As we can only consider canceled orders, we know 
that FPT > LT. This constraint alone, and regard- 
less of the choice of correlation measure, will cause 
strong positive correlations between the two quan- 
tities. Even if FPT and LT are independent, the 
conditional joint distribution reads 



P(FPT = t, LT = i'|FPT > LT) = 

Af{e(t~t')p FPT (t)p LT (t% 



(20) 



9 This is defined by first, for both quantities separately, replacing 
each observation by its rank in the sample (i.e., assigning 1 to the 
largest observation of first passage time, 2 to the second largest, 
etc., and then repeating the procedure for lifetimes). Then the 
usual cross-correlation coefficient is calculated for the ranks |48| . 
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Figure 7: Left: Examples of the distribution of time to fill/cancel for A < buy limit orders of GSK. Right: The distribution 
of reduced time to first fill (TTFF') as a function of the reduced entry depth A'. For orders where A' = 0, the tail of the 

distribution is well fitted by the stretched exponential ^ exp (§) 1/ '*'J ■ Where A' > 0, the distribution decays asymptotically 

as a power law with an exponent close to —2.0. The solid line is the distribution of waiting times between two consecutive 
trades of GSK. 



where is the Hcavisidc step function. Due to our 
restricted observations this is clearly not a product 
of two independent densities. 

Instead, a more convenient null hypothesis is to 
measure the correlations between FPT/LT and LT. 
L = 4 min was chosen such that for A = 1 the dis- 
tribution of the first passage time is well described 
by the power law 

P FPT (t\t>L)~ ^L-i *"^ (21) 

If FPT and LT are independent, then 

P (FPT/LT = x, LT = i'|FPT > LT) = 

Af{0(x - l)P FPT (xt')P hT (t')} = 
M[e(x - 1)x" Afpt ] x jV[.FfcpT (t')PuT (£')]• ( 22 ) 

Eq. (|2"Tj) was used for the second equality. The final 
result is a product form in functions of x and of 
t', which means that FPT/LT is independent from 
LT, given that we restrict ourselves to FPT > LT. 
Remember that the only assumption for this result 
is that first passage times are asymptotically power 
law distributed, which seems to hold very well in 
our data down to L « 4 min. 

We calculated Spearman's rank correlations between 
FPT/LT and LT in our restricted sample for various 
stocks, this we will denote by p rcs - Results are summa- 
rized in Table fVTl One finds negative correlation between 
the two quantities at all usual significance levels. 10 This 



means that those limit orders that would have been exe- 
cuted later were canceled earlier, i.e., that traders update 
their decision on when to cancel a limit order by track- 
ing the price path. This is in line with the results of 
Ref . [HI • To prove that this value of p truly comes from 
correlations, we generated surrogate datasets by random- 
izing the pairs FPT/LT and LT while keeping the con- 
straint FPT > LT. According to Table [VI] this com- 
pletely destroys the correlations between FPT/LT and 
LT, p BUIT = 0. 

It is important to remember that this value of p Ies is 
not the actual correlation coefficient between the first 
passage time and the lifetime process. To quantify the 
true value of cross-correlations, we introduce p t ruc which 
is Spearman's rank-correlation coefficient between LT 
and FPT. While this cannot be measured directly, there 
is a procedure to estimate it from a known value of p ros 
based on Monte Carlo simulation. Let us assume that 
FPT and LT are adequately described by power law dis- 
tributions with the known tail exponents. We model the 
cross-correlation between the two processes by copulas 
(see Ref. [50]). Morgenstern's copula reads 

P(> t, > t') = P FPT (> t)P LT (> t') 
{1 + 3 Ptruc [l - P F pt(> t)][l - Plt(> 0]} - ( 23 ) 

with some —1/3 < ptruc < 1/3, while Frank's copula 



The error bars were estimated by the bootstrapping procedure 



suggested in Ref. [4il (for more details see Refs. therein). 
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P(> t,>t') = - In 



1 



(e 



aPppT (>t) 



l)(e 



«Pit(X') 



1) 



e Q - 1 

(24) 

with some — oo < a < oo. Here P(> t, > t') = 
J t dr J., dr'P(r, r') which is the joint distribution func- 
tion. 

Monte Carlo measurements based on random pairs 
from these copulas suggest a nearly linear relationship be- 
tween the true and the restricted correlation coefficients. 
With the substitution of the typical values of Afpt and 
Alt one finds that 



Ptrue — r X /? res , 



(25) 



where r ps 1.66 for Morgenstern's and r w 1.55 for 
Frank's copula. The resulting estimates are given in Ta- 
ble IVIl Naturally, the shuffled surrogate datasets yield 

Ptrue — /^rcs — 0. 

These calculations have shown that there is a strong 
negative correlation between the first passage time and 
the lifetime of an order in agreement with Ref. [l4| but 
contrary to our model assumption 2 and Eq. (JTUJ) . So 
the key question is: How much does the presence of this 
correlation affect the predictions of our model? We per- 
formed a series of Monte Carlo simulations of the execu- 
tion and cancellation processes by using the empirically 
observed value of tail exponents and cross correlations 
(Table [VI]). We found that for a fixed value of Afpt and 
Alt the introduction of such correlations increases the 
values of Attf and Attc by about 0.1, which is compa- 
rable to the error bars of our estimates, and the power 
law behavior is well preserved. Moreover, the central 
part of our arguments, Eq. (fl7|) . remains valid. Thus 
the presence of a dynamic cancellation strategy does not 
significantly affect the validity of our model. 



these characteristic times are informative of the inter- 
dependence of order executions and cancellations. These 
observations are quite robust and can be seen as "stylized 
' facts" characterizing the order book. 

We did not find significant difference between the be- 
havior of buy and sell orders, in contrast with Refs. @,[5l| 
for US markets, but in accord with Ref. [ll[ for the 
case of Ericsson stock traded at the Stockholm Stock Ex- 
change. We are therefore not able to conclude whether 
the symmetric behavior we observe in the London Stock 
Exchange is common to most markets or specific to some 
of them or to certain time periods. 

In addition to the empirical findings summarized in 
Tables U |TT] and IIIII we introduced a model, where order 
execution times are related to the first passage time of 
price, and orders are canceled randomly with lifetimes 
that are asymptotically power law distributed. This can 
be considered as the simplest possible model to take can- 
cellations into account. In this framework we showed that 
the characteristic exponents of the asymptotic power law 
behavior of the first passage time, the time to (first) fill 
and time to cancel are related to each other by simple 
rules which are in agreement with our empirical observa- 
tions. These results are in contrast with another study 
(the NASDAQ data investigated in Ref. Q). Therefore 
further investigations are needed to clarify whether or 
not our findings are market specific. 

The observed heterogeneity of cancellation times may 
be driven by traders having different time horizons or 
by traders following different cancellation strategies in 
different market environments. Methods that can dis- 
criminate between these mechanisms represent a major 
objective for future research. 
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B. Conclusions 
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Appendix A: RESULTS IN TRANSACTION TIME 

The typical time between transactions strongly de- 
pends on market conditions and it is very far from strictly 
stationary. This fact, also closely related to volatility 
clustering, could influence the distribution of first passage 
times, time to fill and time to cancel. Many recent studies 
measure time in transactions in order to remove fluctu- 
ations in trading activity. In order to better understand 
the role of activity fluctuations, we repeated our calcula- 
tions in transaction time, but we did not find any changes 
that affect the conclusions of our paper. Fig. [5] shows 
comparisons between real time and transaction time for 
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the probability distributions of FPT, TTF and TTC (for 
the stock GSK, A = 1 tick). The short time regime 
is quite different, while for long times the fluctuations 
in trading activity are less relevant, and all the distri- 
butions remain power laws asymptotically. The changes 
in the values of the tail exponents are also small. The 
bottom right panel of Fig. [5] compares Pfpt, Pttf and 
Pttf in transaction time. Our arguments still hold, as 
Afpt < Attf ~ Attc- 



Appendix B: FITTING FUNCTIONS FOR THE 
DISTRIBUTIONS OF TIME TO FILL AND TIME 
TO CANCEL 

In this Appendix we present a critical discussion re- 
garding our decision to fit the empirical distributions of 
FPT, TTF and TTC with the functional form presented 
in Eq. ([5]). In our preliminary investigations, we fitted 
the distribution of FPT, TTF and TTC with two differ- 
ent distributions. The first one was the one we consider 
throughout the paper, i.e., 



Pz(t) = 



at 



1 + (t/T)-^' 



(Bl) 



The second one was the generalized Gamma distribution 

Pa{t) foo ' (B2) 

which has been used in some of the existing studies on 
TTF (e.g. Ref. [§]). Another common form, the Weibull 
distribution, is a special case of Eq. (|B2[) for k = 1 . Our 
empirical analysis shows that the Weibull distribution 
fits the data poorly and it will not be considered in this 
Appendix. For large values of t the density of Eq. (|Bip 
behaves as 



Pz{t) 



1 



(B3) 



The asymptotic behavior of Pg (t) depends on the sign of 
the parameter p. If p < (as for the investigated data) 
it can be written as 



Pc{t) 



exp(-c/tl p l) 

fl+\p\n 



(B4) 



where c is a constant. Thus the generalized Gamma 
distribution, similarly to Eq. (|B1[) . is consistent with 
a power law tail, although it is modulated by an expo- 
nential function which becomes less and less important 
as t — > oo. In order to estimate the optimal parameters 
of the distributions we used a Maximum Likelihood Esti- 
mator (MLE). For illustrative purposes, here we consider 



the case of TTF for AZN and A = but the results are 
similar for other stocks, other values of A and for both 
TTF and TTC. 

Fig.Efleft) shows the distribution of TTF for AZN and 
A = together with fits by Eqs. (JBTJ) and i[B2]l . Both 
Pz and Pq give a good fit both in the tail and in the 
body of the distribution. One finds that Pg has a slightly 
larger likelihood C than Pz- Since the two distributions 
have the same number of parameters (degrees of freedom) 
the likelihoods can be compared directly. However if one 
computes the tail exponents of the distribution from the 
fitted parameters one finds a puzzling result. The tail ex- 
ponent obtained from the generalized Gamma distribu- 
tion fit is 4.5, whereas the tail exponent obtained from the 
Pz fit is 2.2. Such a difference in the exponent should be 
detectable in data. Still, Fig.OJleft) shows that both dis- 
tributions fit the tail reasonably well. The reason of this 
contradiction is shown in the inset of Fig. [SJlcft). This 
plots the local tail exponent of the generalized Gamma 
distribution, given by d[\ogPG(t)]/d[logt], as a function 
of t. The local exponent of the generalized Gamma dis- 
tribution converges extremely slowly to the asymptotic 
value 4.5 and in the range of the TTF from 10 3 to 10 4 
the local exponent is between 2 and 3, which is approxi- 
mately consistent with the values obtained from Pz- 

As we have repeatedly stated, in this paper we are 
interested in the tail behavior of the distribution of the 
time to fill and time to cancel. The analysis summarized 
in Fig.^left) shows that the parameters estimated from a 
fit to a generalized Gamma distribution are not suitable 
to estimate the tail exponent of the distribution, or at 
least not in the regime of TTF and TTC values that can 
be explored within our datasct. In other words, even 
if the generalized Gamma distribution gives a (slightly) 
better fit in terms of likelihood, it is hard to estimate the 
tail exponent from the fitted parameters due to the slow 
convergence of the local exponent. On the contrary, the 
parameters estimated from the fit with functional form 
of Eq. (|B1|) give a better estimate of the tail exponent. In 
order to support this claim, we estimate independe ntly 
the tail exponent by using the Hill estimator [52|, 1 53t | - 
In Fig. fright) we show the Hill plot of the time to fill 
of AZN with A = 0. It is clear that the Hill estimator 
converges to a value which is much closer to 2.2 (as in the 
Pz distribution) than to 4.5 (as in the Pg distribution). 

In conclusion our analysis shows that, although the 
generalized Gamma distribution gives a slightly better 
overall fit of time to fill and time to cancel than our pro- 
posed form [Eq. (|B1[) ]. the parameters obtained from the 
fit of Pg suggest an unrealistic value of the tail exponent. 
On the contrary, our function Pz allows us to both fit the 
data reasonably well and to obtain values of the tail ex- 
ponent which are consistent with the Hill estimator. 
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Figure 8: Comparison of the distribution of characteristic times in real time and transaction time, all plots are for GSK, A = 1 
tick. As a reference two power law decays with exponents 1.5 and 2.1 are also given. Top left: FPT, Top right: TTF, Bottom 
left: TTC, Bottom right: all three quantities, only in transaction time. 
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TTF fraction of points 



Figure 9: Left: Probability density function of the time to fill of limit orders with A = for AZN (boxes). We also show 
the Maximum Likelihood best fit according to the generalized Gamma distribution Pg of Eq. IB2I (red dashed line) and to the 
functional form Pz of Eq. IB II used in this paper (solid blue line). The inset shows the local tail exponent of the generalized 
Gamma distribution (d[log Pc(t)]/d[\og t]) as a function of t. Right: Hill plot of the same data used in the left panel, showing 
the estimated tail exponent of the probability density function as a function of the fraction of points used in the estimation 
(black line). The dashed red line and the solid blue line are the values of the exponent obtained from the fit by Pg and Pz, 
respectively. 



